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SEQUENCE LISTING 

{1} GENERAL INFORMATION i 

(i) APPLICANTS: Petkovich, P, Martin, White, Jay A., 

Beckett, Barbara R. , Jones, Glenville 

Cii) TITLE OF INVENTION; Retinoid Metabolizing Protein 

Uii) NUMBER OF SEQUENCES: 43 

(iv) CORRESPONDENCE ADDRESS: 
(A) ADDRESSEE: Torys LLP 

(E) STREET; 3000 - 79 Wellington Street West 

(C) CITY: Toronto 

(D) STATE: Ontario 

(E) COUNTRY: Canada 

(F) ZIP: M5K 1N2 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3 1/2 inch, 1 . 4 Mb 9torage 

(B) COMPUTER: COMPAQ, IBM PC compatible 
{C) OPERATING SYSTEM: MS-DOS 5,1 

(D) SOFTWARE: WORD PERFECT 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER J 09/668,482 

(B) FILING DATE: September 25, 2000 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBERS: 08/667,546; OB/724,466; PCT/CA97/00440; 
(BJ FILING DATE: June 21, 1996; October 1, 1996; June 23, 1997; 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME ; Hunt, John C. 

(B) REGISTRATION NUMBER \ 36,424 

(C) REFERENCE/DOCKET NUMBER: 32391-2005 

(ix) TELECOMMUNICATION INFORMATION; 

(A) TELEPHONE: (416) 865-8121 

(B) TELEFAX : (416) 865-73B0 



(2) INFORMATION FOR SEQ ID NO: I 

(i) SEQUENCE CHARACTERISTICS : ■ 

(A) LENGTH: 337 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l 

TGCCAGTGGA CAATCTCCCT ACCAAATTCA CTAGTTATGT CCAGAAATTA GCCTAAACCG 60 

GAGCCTTTGT ACATATGTTT TTATTTTAGA TGAACTGTGA TGTATTGGAT ATTTTCTAAT 120 

TTGTTTATAT AAAGCAGATG TGTATATAAG TCTATGCGAA GAAGCGAAAA CGAGGGCACT 180 
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ACTTTCTCAT GGATCACTGT AATGCTACAG AGTGTCTGTG ATGTATATTT ATAATGTAGT 240 
TGTGTCATAT AGCTTTTGTA CTGTATGCAA CTTATTTAAC TCGCTCTTTA TCTCATGGGT 300 
TTTATTTAAT AAAACATGTT CTTACAAAAA AAAAAAA 337 



(2) INFORMATION FOR SEQ ID NO: 2 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 492 amino acids 

(B) TYPE: amino acici 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 

Met Gly Leu Tyr Thr Leu Met Val Thr Phe Leu Cys Thr lie Val Leu 
15 10 15 

Pro Val Leu Leu Phe Leu Ala Ala Val Lys Leu Trp Glu Met Leu Met 

20 25 30 

lie Arg Arg Val Asp Pro Asn Cys Arg Ser Pro Leu Pro Pro Gly Thr 
35 40 45 

Met Gly Leu Pro Phe lie Gly Glu Thr Leu Gin Leu He Leu Gin Arg 
50 55 60 

Arg Lya Phe Leu Arg Met Lys Arg Gin Lys Tyr Gly Cys lie Tyr Lys 
65 70 75 80 

Thr His Leu Phe Gly Asn Pro Thr Val Arg Val Met Gly Ala Asp Asn 

85 90 95 

Val Arg Gin He Leu Leu Gly Glu His Lys Leu Val Set Val Gin Trp 

100 105 110 

Pro Ala Ser Val Arg Thr He Leu Gly Ser Asp Thr Leu Ser Asn Val 
115 120 125 

His Gly Val Gin His Lys Asn Lys Lys Lys Ala He Met Arg Ala Phe 
130 135 140 

Ser Arg Asp Ala Leu Glu His Tyr He Pro Val He Gin Gin Glu Val 
145 150 155 160 

Lys Ser Ala He Gin Glu Trp Leu Gin Lys Asp Ser Cys Val Leu Val 

165 170 175 

Tyr Pro Glu Met Lys Lys Leu Met Phe Arg He Ala Met Arg He Leu 

180 185 190 

Leu Gly Phe Glu Pro Glu Gin He Lys Thr Asp Glu Gin Glu Leu Val 
195 200 205 
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Glu Ala Phe Glu 
210 

Val Pro Phe Ser 
225 

His Ser Lys lie 



Asn Glu Asn Glu 

260 

Asn Ser Arg Arg 

275 

Ala Ala Thr Glu 
290 

Ala Thr Ser Leu 
305 

Lys Val Arg Glu 



Pro Gly Lys Gly 

340 

Gly Cys Val He 
355 

Gly Phe Arg val 
370 

Pro Lys Gly Trp 
365 

Ala Asp Val Phe 



Ser Lys Gly Leu 

420 

Gly Gly Ser Arg 
435 

Lys He Phe Leu 
450 

Asn Gly Pro Pro 
4 65 

Asn Leu Pro Thr 



Glu Met He Lys 
215 

Gly Leu Tyr Arg 
230 

Glu Glu Asn He 

245 

Gin Lys Tyr Lys 



Ser Asp Glu Pro 

280 

Leu Leu Phe Gly 
295 

Val Met Phe Leu 
310 

Glu Val Gin Glu 
325 

Leu Ser Met Glu 



Lys Glu Thr Leu 

360 

Ala Leu Lys Thr 
375 

Asn Val He Tyr 
390 

Pro Asn Lys Glu 
405 

Glu Asp Gly Ser 



Met Cys Val Gly 

440 

Val Glu Leu Thr 
455 

Thr Met Lys Thr 
470 

Lys Phe Thr Ser 
485 
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Asn Leu Phe Ser 

220 

Gly Leu Arg Ala 
235 

Arg Lys Lys He 
250 

Asp Ala Leu Gin 

265 

Phe $er Leu Gin 



Gly His Glu Thr 

300 

Gly Leu Asn Thr 
315 

Lys Val Glu Met 

330 

Leu Leu Asp Gin 
345 

Arg He Asn Pro 



Phe Glu Leu Asn 

330 

Ser He Cys Asp 
395 

Glu Phe Gin Pro 
410 

Arg Phe Asn Tyr 
425 

Lys Glu Phe Ala 



Gin His Cys Asn 

460 

Gly Pro Thr Ha 
475 

Tyr Val Arg Asn 
490 



Leu Pro He Asp 



Arg Asn Phe He 

240 

Gin Asp Asp Asp 
255 

Leu Leu lie Glu 

270 

Ala Met Lys Glu 
285 

Thr Ala Ser Thr 



Glu Val Val Gin 

320 

Gly Met Tyr Thr 

335 

Leu Lys Tyr Thr 

350 

Pro Val Pro Gly 
365 

Gly Tyr Gin He 



Thr His Asp Val 

400 

Glu Arg Phe Met 
415 

lie Pro Phe Gly 
430 

Lys Val Leu Leu 
445 

Trp He Leu Ser 



Tyr Pro Val Asp 

480 
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(2) INFORMATION FOR SEQ ID NO: 3 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1850 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE55 : single 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 

TGTCGCCGTT GCTGTCGGTT GCTGTCGGAC GCTGTCTCCT CTCCAGAAGC TTGTTTTTCG 60 

TTTTGGCGAT CAGTTGCGCG CTTCAAC ATG GGG CTG TAC ACC CTT ATG GTC ACC 114 

Met Gly Leu Tyr Thr Leu Met Val Thr 
1 5 

TTT CTC TGC ACC ATC GTG CTA CCC GTT TTA CTC TTT CTC GCC GCG GTG 162 
Phe Leu Cys Thr He Val Leu Pro Val Leu Leu Phe Leu Ala Ala Val 
10 15 20 25 

AAG TTG TGG GAG ATG TTA ATG ATC CGA CGA QTC GAT CCG AAC TGC AGA 210 
Lys Leu Trp Glu Met Leu Met lie Arg Arg Val Asp Pro Asn Cys Arg 

30 35 40 

AGT CCT CTA CCG CCA GGT ACC ATG GGC TTG CCG TTC ATT GGA GAA ACG 2 5G 

Ser Pro Leu Pro Pro Gly Thr Met Gly Leu Pro Phe lie Gly Glu Thr 

45 50 55 

CTC CAG CTG ATC CTC CAG AGA AGG AAG TTT CTG CGC ATG AAA CGG CAG 306 
Leu Gin Leu lie Leu Gin Arg Arg Lys Phe Leu Arg Met Lys Arg Gin 
60 65 70 

AAA TAC GGG TGC ATC TAC AAG ACQ CAC CTC TTC GGG AAC CCG ACT GTC 354 
Lys Tyr Gly Cya He Tyr Lys Thr His Leu Phe Gly Asn Pro Thr Val 
15 SO 85 

AGG GTG ATG GGA GCT GAT AAT GTG AGG CAG ATT CTG CTG GGC GAA CAC 402 
Arg Val Met Gly Ala Asp Asn Val Arg Gin He Leu Leu Gly Glu His 
90 95 100 105 

AAG CTG GTG TCT GTT CAG TGG CCA GCA TCA GTG AGA ACC ATC CTG GGC 450 
Lys Leu Val Ser Val Gin Trp Pro Ala Ser Val Arg Thr He Leu Gly 

110 115 120 

TCT GAC ACC CTC TCC AAT GTC CAT GGA GTT CAA CAC AAA AAC AAG AAA 4 98 

Ser Asp Thr Leu Ser Asn Val His Gly Val Gin His Lys Asn Lys Lys 

125 130 X35 

AAG GCC ATT ATG AGG GCG TTC TCT CGA GAT GCT CTG GAG CAC TAC ATT 54 6 

Lys Ala He Met Arg Ala Phe Ser Arg Asp Ala Leu Glu His Tyr He 
140 145 150 

CCC GTG ATC CAG CAG GAG GTG AAG AGC GCC ATA CAG GAA TGG CTG CAA 594 
Pro Val He Gin Gin Glu Val Lys Ser Ala He Gin Glu Trp Leu Gin 
155 160 165 
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AAA GAC TCC TGC GTG CTG GTT TAT CCA GAA ATG AAG AAA CTC ATG TTT 642 
Lys Asp Ser Cys Val Leu Val Tyr Pro Glu Met Lys Lys Leu Met Phe 
170 175 180 185 

CGG ATA GCT ATG AGA ATC CTG CTT GGT TTT GAA CCA GAG CAA ATA AAG 690 
Arg lie Ala Met Arg iJLa Leu Leu Gly Phe Glu Pro Glu Gin lie Lys 

190 195 200 

ACG GAC GAG CAA GAA CTG GTG GAA GCT TTT GAG GAA ATG ATC AAA AAC 73 B 

Thr Asp Glu Gin Glu Leu Val Glu Ala Phe Glu Glu Met He Lys Asn 

205 210 215 

TTG TTC TCC TTG CCA ATC GAC GTT CCT TTC AGT GGT CTG TAC AGG GGT 786 
Leu Phe Ser Leu Pre He Asp Val Pro Phe Ser Gly Leu Tyjr Arg Gly 
220 225 230 

TTG AGG GCA CGC AAT TTC ATT CAC TCC AAA ATT GAG GAA AAC ATC AGG 834 
Leu Arg Ala Arg Asn Phe He His Ser Lys lie Glu Glu Asn He Arg 
235 240 245 

AAG AAA ATT CAA GAT GAC GAC AAT GAA AAC GAA CAG AAA TAC AAA GAC 8 82 

Lys Lys He Gin Asp Asp Asp Asn Glu Asn Glu gin Lys Tyr Lys Asp 
250 255 260 265 

GCC CTT CAG CTG TTG ATC GAG AAC AGC AGA AGA AGT GAC GAA CCT TTT 930 
Ala Leu Gin Leu Leu He Glu Asn Ser Arg Arg Ser Asp Glu Pro Phe 

270 275 290 

AGT TTG CAG GCG ATG AAA GAA GCA GCT ACA GAG CTT CTA TTT GGA GGT 979 
Ser Leu Gin Ala Met Lys Glu Ala Ala Thr Glu Leu Leu Phe Gly Gly 

285 290 295 

CAT GAA ACC ACC GCC AGC ACT GCA ACC TCA CTT GTC ATG TTT CTG GGT 1026 
His Glu Thr Thr Ala Ser Thr Ala Thr Ser Leu Val Met Phe Leu Gly 
300 305 310 

CTG AAC ACA GAA GTG GTG CAG AAG GTC AGA GAG GAG GTT CAG GAG AAG 1074 
Leu Asn Thr Glu Val Val Gin Lys Val Arg Glu Glu Val Gin Glu Lys 
315 320 325 

GTT GAA ATG GGC ATG TAT ACA CCT GGA AAG GGC TTG AGT ATG GAG CTG 1122 
Val Glu Met Gly Met Tyr Thr Pro Gly Lys Gly Leu Ser Met Glu Leu 
330 335 340 345 

TTG GAC CAG CTG AAG TAC ACT GGA TGT GTG ATT AAA GAG ACT CTT AGA 1170 
Leu Asp Gin Leu Lys Tyr Thr Gly Cys Val He Lys Glu Thr Leu Arg 

350 355 360 

ATC AAC CCT CCT GTT CCC GGA GGA TTC AGA GTC GCA CTC AAA ACC TTT 1218 
He Asn Pro Pro Val Pro Gly Gly Phe Arg Val Ala Leu Lys Thr Phe 

365 370 375 

GAA TTG AAT GGT TAC CAA ATT CCT AAA GGA TGG AAC GTC ATT TAC AGC 12 G6 

Glu Leu Asn Gly Tyr Gin lie Pro Lys Gly Trp Asn Val He Tyr Ser 
3B0 385 390 
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ATC TGT GAC ACG CAC GAT GTG GCC GAG GTC TTT CCA AAC AAA GAG GAG 1314 
lie Cys Asp Thr His Asp Val Ala Asp Val Phe Pro Asn Lys Glu Glu 
395 400 405 

TTC CAG CCG GAG AGA TTC ATG AGC AAA GGT CTG GAG GAC GGG TCC AGG 1362 
Phe Gin Pro Glu Arg Phe Met Ser Lys Gly Leu Glu Asp Gly Ser Arg 
410 415 420 425 

TTT AAC TAG ATC CCC TTC GGA GGA GGA TCC AGG ATG TGT GTG GGC AAA 1410 
Phe Asn Tyr lie Prd Phe Gly Gly Gly Ser Arg Met Cys Val Gly Lys 

430 435 440 

GAG TTC GCC AAA GTG TTA CTC AAG ATC TTT TTA GTT GAG TTA ACG CAG 1458 
Glu Phe Ala Lys Val Leu Leu Lys lie Phe Leu Val Glu Leu Thr Gin 

445 450 455 

CAT TGC AAT TGG ATT CTC TCA AAC GGA CCC CCG ACA ATG AAA ACA GGC 1506 
His Cys Asn Trp He Leu Ser Asn Gly Pro Pro Thr Met Lys Thr Gly 
460 465 470 

CCG ACT ATT TAC CCA GTG GAC AAT CTC CCT ACC AAA TTC ACT AGT TAT 1554 
Pro Thr He Tyr Pro Val Asp Asn Leu Pro Thr Lys Phe Thr Ser Tyr 
475 480 485 

GTC AGA AAT TAGCCTAACC GGAGCTTTGT ACATATGTTT TTATTTTAGA 1603 

Val Arg Asn 

490 

TGAACTGTGA TGTATTGGAT ATTTTCTATT TTGTTTATAT AAAGCAGATG TGTATATAAG 1663 

TCTATGCGAG GAAGCGAAAA CGAGGGCACT ACTTTCTCAT GGATCACTGT AAT GC TAC AG 1723 

AGTGTCTGTG ATGTATATTT ATAATGTAGT TGTGTTATAT AGCTTTTGTA CTGTATGCAA 1783 

CTTATTTAAC TCGCTCTTTA TCTCATGGGT TTTATTTAAT AAAACATGTT CTTACAAAAA 184 3 

AAAAAAA 1850 



(2) INFORMATION FOR SEQ ID NO: 4 

(i) SEQUENCE CHARACTERISTICS t 
(A) LENGTH: 4 97 amino acids 
CB) TYPE: amino acid 

(C) STRANDEDNESS: single 

[D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 

Met Gly Leu Pro Ala Leu Leu Ala Ser Ala Leu Cys Thr Phe Val Leu 
15 10 15 

Pro Leu Leu Leu Phe Leu Ala Ala lie Lys Leu Trp Asp Leu Tyr Cys 

20 25 30 

Val Ser Gly Arg Asp Arg Ser Cys Ala Leu Pro Leu Pro Pro Gly Thr 
35 40 45 
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Met Gly Phe Pro Phe Phe Gly Glu Thr Leu Gin Met Val Leu Gin Arg 
50 55 60 

Arg Lys Phe Leu Gin Met Lys Arg Arg Lys Tyr Gly Phe He Tyr Lys 
65 70 75 BO 

Thr His Leu Phe Gly Arg Pro Thr Val Arg Val Met Gly Ala Asp Asn 

85 90 95 

Val Arg Arg He Leu Leu Gly Asp Asp Arg Leu Val Ser Val His Trp 

100 105 110 

Pro Ala Ser Val Arg Thr He Leu Gly Ser Gly Cys Leu Ser Asn Leu 
115 120 125 

His Asp Ser Ser His Lys Gin Arg Lys Lys Val He Met Arg Ala Phe 
130 135 140 

Ser Arg Glu Ala Leu Glu Cys Tyr Val Pro Val He Thr Glu Glu Val 
145 150 155 160 

Gly Ser Ser Leu Glu Gin Trp Leu Ser Cys Gly Glu Arg Gly Leu Leu 

165 170 175 

Val Tyr Pro Glu Val Lys Arg Leu Met Phe Arg He Ala Met Arg He 

160 185 190 

Leu Leu Gly Cys Glu Pro Gin Leu Ala Gly Asp Gly Asp Ser Glu Gin 

195 200 205 

Gin Leu Val Glu Ala Phe Glu Glu Met Thr Arg Asn Leu Phe Ser Leu 
210 215 220 

Pro He Asp Val Pro Phe Ser Gly Leu Tyr Arg Gly Met Lys Ala Arg 
225 230 235 240 

Asn Leu He His Ala Arg He Glu Gin Asn He Arg Ala Lys He Cys 

245 250 255 

Gly Leu Arg Ala Ser Glu Ala Gly Gin Gly Cys Lys Asp Ala Leu Gin 

2€0 265 270 

Leu Leu He Glu His Ser Trp Glu Arg Gly Glu Arg Leu Asp Met Gin 

275 280 285 

Ala Leu Lys Gin Ser Ser Thr Glu Leu Leu Phe Gly Gly His Glu Thr 
290 295 300 

Thr Ala Ser Ala Ala Thr Ser Leu lie Thr Tyr Leu Gly Leu Tyr Pro 
305 310 315 320 

His Val Leu Gin Lys Val Arg Glu Glu Leu Lys Ser Lys Gly Leu Leu 

325 330 335 

Cys Lys Ser Asn Gin Asp Asn Lys Leu Asp Met Glu He Leu Glu Gin 

340 345 350 
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Leu Lys Tyr lie Gly Cys Val lie Lys Glu Thr Leu Arg Leu Asn Pro 
355 360 365 

Pro Val Pro Gly Gly Phe Arg Val Ala Leu Lys Thr Phe Glu Leu Asn 
370 375 380 

Gly Tyr Gin lie Pro Lys Gly Trp Asn Val lie Tyr Ser lie Cys Asp 
385 390 395 400 

Thr His Asp Val Ala Glu He Phe Thr Asn Lys Glu Glu Phe Asn Pro 

405 410 415 

Asp Arg Phe Ser Ala Pro His Pro Glu Asp Ala Ser Arg Phe Ser Phe 

420 425 430 

lie Pro Phe Gly Gly Gly Leu Arg Ser Cys Val Gly Lys Glu Phe Ala 
435 440 445 

Lys lie Leu Leu Lys lie Phe Thr Val Glu Leu Ala Arg fiis Cys Asp 
450 455 460 

Trp Gin Leu Leu Asn Gly Pro Pro Thr Met Lys Thr Ser Pro Thr Val 
465 470 475 460 

Tyr Pro Val Asp Asn Leu Pro Ala Arg Phe Thr His Phe His Gly GlU 

485 490 495 

He 



(2) INFORMATION FOR SEQ ID NO; 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1494 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 

ATG GGG CTC CCG GCG CTG CTG GCC AGT GCG CTC TGC ACC TTC GTG CTG 48 
Met Gly Leu Pro Ala Leu Leu Ala Ser Ala Leu Cys Thr Phe Val Leu 
15 10 15 

CCG CTG CTG CTC TTC CTG GCT GCG ATC AAG CTC TGG GAG CTG TAC TGC 96 
Pro Leu Leu Leu Phe Leu Ala Ala He Lys Leu Trp Asp Leu Tyr Cys 

20 25 30 

GTG AGC GGC CGC GAC CGC AGT TGT GCC CTC CCA TTG CCC CCC GGG ACT 14 4 

Val Ser Gly Arg Asp Arg Ser Cys Ala Leu Pro Leu Pro Pro Gly Thr 
35 40 45 

ATG GGC TTC CCC TTC TTT GGG GAA ACC TTG CAG ATG GTA CTG CAG CGG 192 
Met Gly Phe Pro Phe Phe Gly Glu Thr Leu Gin Met Val Leu Gin Arg 
50 55 60 
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AGG AAG TTC CTG CAG ATG AAG CGC AGG AAA TAG GGC TTC ATC TAG AAG 240 
Arg Lys Phe Leu Gin Met Lys Arg Arg Lys Tyr Gly Phe He Tyr Lys 
65 70 75 BO 

ACG CAT CTG TTC GGG CGG CCC ACC GTA CGG GTG ATG GGC GCG GAC AAT 288 
Thr His Leu Phe Gly Axg Pro Thr Val Arg Val Met Gly Ala Asp Asn 

85 90 95 

GTG CGG CGC ATC TTG CTC GGA GAC GAC CGG CTG GTG TCG GTC CAC TGG 336 
Val Arg Arg He Leu Leu Gly Asp Asp Arg Leu Val Ser Val His Trp 

100 105 110 

CCA GCG TCG GTG CGC ACC ATT CTG GGA TCT GGC TGC CTC TCT AAC CTG 364 
Pro Ala Ser Val Arg Thr He Leu Gly Ser Gly Cys Leu Ser Asn Leu 
115 120 125 

CAC GAC TCC TCG CAC AAG CAG CGC AAG AAG GTG ATT ATG CGG GCC TTC 4 32 

His Asp Ser Ser His Lys Gin Arg Lys Lys Val He Met Arg Ala Phe 
130 135 140 

AGC CGC GAG GCA CTC GAA TGC TAC GTG CCG GTG ATC ACC GAG GAA GTG 4 80 

Ser Arg Glu Ala Leu Glu Cys Tyr Val Pro Val He Thr Glu Glu Val 
145 150 155 160 

GGC AGC AGC CTG GAG CAG TGG CTG AGC TGC GGC GAG CGC GGC CTC CTG 528 
Gly Ser Ser Leu Glu Gin Trp Leu Ser Cys Gly Glu Arg Gly Leu Leu 

165 HO 175 

GTC TAC CCC GAG GTG AAG CGC CTC ATG TTC CGA ATC GCC ATG CGC ATC 57 6 

Val Tyr Pro Glu Val Lys Arg Leu Met Phe Arg He Ala Met Arg He 

180 185 190 

CTA CTG GGC TGC GAA CCC CAA CTG GCG GGC GAC GGG GAC TCC GAG CAG 624 
Leu Leu Gly Cys Glu Pro Gin Leu Ala Gly Asp Gly Asp Ser Glu Gin 
195 200 205 

CAG CTT GTG GAG GCC TTC GAG GAA ATG ACC CGC AAT CTC TTC TCG CTG 672 
Gin Leu Val Glu Ala Phe Glu Glu Met Thr Arg Asn Leu Phe Ser Leu 
210 215 220 

CCC ATC GAC GTG CCC TTC AGC GGG CTG TAC CGG GGC ATG AAG GCG CGG 720 
Pro He Asp Val Pro Phe Ser Gly Leu Tyr Arg Gly Met Lys Ala Arg 
225 230 235 240 

AAC CTC ATT CAC GCG CGC ATC GAG CAG AAC ATT CGC GCC AAG ATC TGC 7 6B 

Asn Leu He His Ala Arg He Glu Gin Asn He Arg Ala Lys He Cys 

245 250 255 

GGG CTG CGG GCA TCC GAG GCG GGC CAG GGC TGC AAA GAC GCG CTG CAG 816 
Gly Leu Arg Ala Ser clu Ala Gly Gin Gly Cys Lys Asp Ala Leu Gin 

260 265 270 

CTG TTG ATC GAG CAC TCG TGG GAG AGG GGA GAG CGG CTG GAC ATG CAG 8 64 

Leu Leu He Glu His Ser Trp Glu Arg Gly Glu Arg Leu Asp Met Gin 
275 280 285 
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GCA CTA AAG CAA TCT TCA ACC GAA CTC CTC TTT GGA GGA CAC GAA ACC 912 
Ala Leu Lys Gin Ser Ser Thr Glu Leu Lea Phe Gly Gly His Glu Thr 
290 295 300 

ACG GCC AGT GCA GCC ACA TCT CTG ATC ACT TAC CTG GGG CTC TAG CCA 960 
Thr Ala Ser Ala Ala Thr Ser Leu lie Thr Tyr Leu Gly Leu Tyr Pro 
305 310 315 320 

CAT GTT CTC CAG AAA GTG CGA GAA GAG CTG AAG AGT AAG GGT TTA CTT 1006 
His Val Leu Gin Lys Val Arg Glu Glu Leu Lys Ser Lys Gly Leu Leu 

325 330 335 

TGC AAG AGC AAT CAA GAC AAC AAG TTG GAC ATG GAA ATT TTG GAA CAA 1056 
Cys Lys Ser Asn Gin Asp Asn Lys Leu Asp Met Glu lie Leu Glu Gin 

340 345 350 

CTT AAA TAC ATC GGG TGT GTT ATT AAG GAG ACC CTT CGA CTG AAT CCC 1104 
Leu Lys Tyr lie Gly Cys Val He Lys Glu Thr Leu Arg Leu Asn Pro 
355 360 365 

CCA GTT CCA GGA GGG TTT CGG GTT GCT CTG AAG ACT TTT GAA TTA AAT 1152 
Pro Val Pro Gly Gly Phe Arg Val Ala Leu Lys Thr Phe Glu Leu Asn 
370 375 380 

GGA TAC CAG ATT CCC AAG GGC TGG AAT GTT ATC TAC AGT ATC TGT GAT 1200 

Gly Tyr Gin He Pro Lys Gly Trp Asn Val He Tyr ser He Cys Asp 
385 390 395 400 

ACT CAT GAT GTG GCA GAG ATC TTC ACC AAC AAG GAA GAA TTT AAT CCT 1246 
Thr His Asp Val Ala Glu He Phe Thr Asn Lys Glu Glu Phe Asn Pro 

405 410 415 

GAC CGA TTC AGT GCT CCT CAC CCA GAG GAT GCA TCC AGG TTC AGC TTC 1296 
Asp Arg Phe Ser Ala Pro His Pro Glu Asp Ala Ser Arg Phe Ser Phe 

420 425 430 

ATT CCA TTT GGA GGA GGC CTT AGG AGC TGT GTA GGC AAA GAA TTT GCA 134 4 

He Pro Phe Gly Gly Gly Leu Arg Ser Cys Val Gly Lys Glu Phe Ala 
435 440 445 

AAA ATT CTT CTC AAA ATA TTT ACA GTG GAG CTG GCC AGG CAT TGT GAC 1392 
Lys lie Leu Leu Lys He Phe Thr Val Glu Leu Ala Arg His Cys Asp 
450 455 460 

TGG CAG CTT CTA AAT GGA CCT CCT ACA ATG AAA ACC AGT CCC ACC GTG 1440 
Trp Gin Leu Leu Asn Gly Pro Pro Thr Met Lys Thr Ser Pro Thr Val 
465 470 475 480 

TAT CCT GTG GAC AAT CTC CCT GCA AGA TTC ACC CAT TTC CAT GGG GAA 1488 
Tyr Pro Val Asp Asn Leu Pro Ala Arg Phe Thr His Phe His Gly Glu 

485 490 495 

ATC TGA 14 94 

lie 
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(2) INFORMATION FOR SEQ ID NO: 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(d) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 

Pro Phe Gly Gly Gly Pro Arg Leu Cys Pro Gly Tyr Glu Leu Ala Arg 
15 10 15 

Val Ala Leu Ser 

20 



(2) INFORMATION FOR SEQ ID NO: 7 

(i} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY r linear 

(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 7 

Pro Phe Ser Gly Gly Ala Arg Asn Cys He Gly Lys Gin Phe Ala Met 
1 5 10 15 

Ser Glu Met Lys 

20 



(2) INFORMATION FOR SEQ ID NO: 8 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 
Pro Phe Ser Gly Gly Ala Arg Asn Cys He Gly Lys Gin Phe Ala Met 
15 10 15 

Asn Glu Leu Lys 

20 



(2) INFORMATION FOR SEQ ID NO : 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



PAGE 26143 * RCVD AT 6/22/2004 5:27:52 PM [Eastern Daylight Time] ' SVR:USPTO-EFXRF-3/26 * DNIS:2730d41 * CSID:416 865 7380 * DURATION (mm-ss):10-38 



JUN-22-2004 17:40 



TORYS LLP TORONTO 



416 865 7380 P. 



- 12/28 - 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 

Pro Phe Gly Thr Gly Pro Arg Asn Cys lie Gly Met Arg Phe Ala lie 
15 10 15 

Met Asn Met Lys 

20 



(2) INFORMATION FOR SEQ ID NO: 10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 

Pro Phe Ser Gly Gly Ser Arg Asn Cys He Gly Lys Gin Phe Ala Met 
15 10 15 

Asn Glu Leu Lys 

20 



(2) INFORMATION FOR SEQ ID NO: 11 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH; 351 base pairs 
(E) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 



GAACTCCTCT TTGGAGGACA CGAAACCACG GCCAGTGCAG CCACATCTCT GATCACTTAC 60 

CTGGGGCTCT ACCCACATGT TCTCCAGAAA GTGCGAGAAG AGCTGAAGAG TAAGGGTTTA 120 

CTTTGCAAGA GCAATCAAGA CAACAAGTTG GACATGGAAA TTTTGGAACA ACTTAAATAC 180 

ATCGGGTGTG TTATTAAGGA GACCCTTCGA CTGAATCCCC CAGTTCCAGG AGGGTTTCGG 240 

GTTGCTCTGA AGACTTTTGA ATTAAATGGA TACCAGATTC CCAAGGGCTG GAATGTTATC 300 

TACAGTATCT GTGATACTCA TGATGTGGCA GAGATCTTCA CCAACAAGGA A 351 



(2) INFORMATION FOR SEQ ID NO; 12 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
TTTTTTTTTT TTGG 14 
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(2) INFORMATION FOR SEQ ID NO: 13 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
TTTTTTTTTT TTGA 14 



(2) INFORMATION FOR SEQ ID NO: 14 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 14 base pairs 
<B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

<D} TOPOLOGY! linear 
(Xi) SEQUENCE DESCRIPTION 1 SEQ ID NO ; 14 
TTTTTTTTTT TTGT 14 



(2) INFORMATION FOR SEQ ID NO: 15 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH; 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
TTTTTTTTTT TTGC 14 



(2) INFORMATION FOR SEQ ID NO: 16 

(i) SEQUENCE CHARACTERISTICS r 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
TTTTTTTTTT TTAG 14 



(2) INFORMATION FOR SEQ ID NO: 17 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
TTTTTTTTTT TTAA 14 

(2) INFORMATION FOR SEQ ID NO: 18 

(i) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH i 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
TTTTTTTTTT TTAT 14 

(2) INFORMATION FOR SEQ ID NO:l£ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO; 19 
TTTTTTTTTT TTAC 14 

(2) INFORMATION FOR SEQ ID NO:20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NQ:20 
TTTTTTTTTT TTCG 14 

(2) INFORMATION FOR SEQ ID NO; 21 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY : linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 

TTTTTTTTTT TTCA 14 



PAGE 29/43 * RCVD AT 6/22/2004 5:27:52 PM [Eastern Daylight Time] ' SVR:USPTO-EFXRF-3126 1 DNIS:2730941 < CSID:416 865 7380 * DURATION (mm-ss):1 0-38 



JUN-22-2004 17:41 



TORYS LLP TORONTO 



416 S65 7380 P 



- 15/28- 



(2) INFORMATION FOR SEQ ID NO: 22 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION ; SEQ ID NO;22 
TTTTTTTTTT TTCT 14 



(2) INFORMATION FOR SEQ ID NO: 23 

(i) SEQUENCE CHARACTERISTICS: 
{A> LENGTH: 14 base pairs 
{ B ) TYPE: nucleic acid 
(Ci STRANDEDNESS: single 
(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO; 23 

TTTTTTTTTT TTCC 14 



(2) INFORMATION FQR SEQ ID NO: 24 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 

AAGCGACCGA 10 



(2) INFORMATION FOR SEQ ID NO: 25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) type: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
TGTTCGCCAG 10 



(2) INFORMATION FOR SEQ ID NO: 26 

(i) SEQUENCE CHARACTERISTICS; 

{A) LENGTH : 10 base pairs 

(B) TYPE: nucleic acid 

{C> STRANDEDNESS: single 

(D) TOPOLOGY; linear 
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Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
TGCCAGTGGA 10 

(2) INFORMATION FOR SEQ ID NO; 27 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO i 2*7 
GGCTGCAAAC 10 

(2) INFORMATION FOR SEQ ID NO; 28 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 
CCTAGCGTTG 10 

(2) INFORMATION FOR SEQ ID NO: 29 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOi29 
GTAGCGGCCG CTGCCAGTGG A 21 

[2) INFORMATION FOR SEQ ID NQ:3Q 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(DJ TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 

GTAGCGGCCG CT 12 
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(2) INFORMATION FOR SEQ ID NO: 31 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1725 base pairs 

(B) TYPE: nucleic aci<3 

(C) STRANDEDNE55 ; single 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 31 

GCACGAGGGA GGCTGAAGCG TGCC ATG GGG CTC CCG GCG CTG CTG GCC AGT 51 

Met Gly Leu Pro Ala Leu Leu Ala Ser 
1 5 

GCG CTC TGC AGC TTC GTG CTG CCG CTG CTG CTC TTC CTG GCG GCG CTC 99 
Ala Leu Cys Thr Phe Val Leu Pro Leu Leu Leu Phe Leu Ala Ala Leu 
10 15 20 25 

AAG CTC TGG GAC CTG TAC TGT GTG AGC AGC CGC GAT CGC AGC TGC GCC 147 
Lys Leu Trp Aap Leu Tyr Cys Val Ser Ser Arg Asp Arg Ser Cys Ala 

30 35 40 

CTC CCC TTG CCC CCC GGT ACC ATG GGC TTC CCA TTC TTT GGG GAA ACA 195 
Leu Pro Leu Pro Pro Gly Thr Met Gly Phe Pro Phe Phe Gly Glu Thr 

45 50 55 

TTG CAG ATG GTG CTT CAG CGG AGG AAG TTT CTG CAG ATG AAG CGC AGG 24 3 

Leu Gin Met Val Leu Gin Arg Arg Lys Phe Leu Gin Met Lys Arg Arg 
60 65 70 

AAA TAC GGC TTC ATC TAC AAG ACG CAT CTG TTT GGG CGG CCC ACG GTG 291 
Lys Tyr Gly Phe He Tyr Lys Thr His Leu Phe Gly Arg Pro Thr Val 
75 80 85 

CGG GTG ATG GGC GCG GAT AAT GTG CGG CGC ATC TTG CTG GGA GAG CAC 339 
Arg Val Met Gly Ala Asp Asn Val Arg Arg He Leu Leu Gly Glu His 
90 95 100 105 

CGG TTG GTG TCG GTG CAC TGG CCC GCG TCG GTG CGC ACC ATC CTG GGC 38 7 

Arg Leu Val Ser Val His Trp Pro Ala Ser Val Arg Thr He Leu Gly 

110 115 120 

GCT GGC TGC CTC TCC AAC CTG CAC GAT TCC TCG CAC AAG CAG CGA AAG 435 
Ala Gly Cys Leu Ser Asn Leu His Asp Ser Ser His Lys Gin Arg Lys 

125 130 135 

AAG GTG ATT ATG CAG GCC TTC AGC CGC GAG GCA CTC CAG TGC TAC GTG 46 3 

Lys Val He Met Gin Ala Phe Ser Arg Glu Ala Leu Gin Cys Tyr Val 
140 145 150 

CTC GTG ATC GCT GAG GAA GTC AGC AGT TGT CTG GAG CAG TGG CTA AGC 531 
Leu Val He Ala Glu Glu Val Ser Ser Cys Leu Glu Gin Trp Leu Ser 
155 160 165 

TGC GGC GAG CGC GGC CTC CTG GTC TAC CCC GAG GTG AAG CGC CTC ATG 57 9 

Cys Gly Glu Arg Gly Leu Leu Val Tyr Pro Glu Val Lys Arg Leu Met 
170 175 100 195 
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TTC CGC ATC GCC ATG CGC ATC CTG CTG GGC TGC GAG CCG GGT CCA GCG 627 
Phe Arg He Ala Met Arg Il« Leu Leu Gly Cys Glu Pro Gly Pro Ala 

190 195 200 

GGC GGC GGG GAG GAC GAG CAA CAG CTC GTG GAG GCT TTC GAG GAG ATG 675 
Gly Gly Gly Glu Asp Glu Gin Gin Leu Val Glu Ala Phe Glu Glu Met 

205 210 215 

ACC CGC AAT CTC TTC TCT CTT CCC ATT GAC QTQ CCC TTT AGC GGC CTG 723 
Thr Arg Asn Leu Phe Ser L^u Pro lie Asp Val Pro Phe Ser Gly Leu 
220 225 230 

TAC CGG GGC GTG AAG GCG CGG AAC CTT ATA CAC GCG CGC ATC GAG GAG 771 
Tyr Arg Gly Val Lys Ala Arg Asn Leu He His Ala Arg He Glu Glu 
235 240 245 

AAC ATT CGC GCC AAG ATC CGC CGG CTT CAG GCT ACA GAG CCG GAT GGG 819 
Asn He Arg Ala Lys He Arg Arg Leu Gin Ala Thr Glu Pro Asp Gly 
250 255 260 265 

GGT TGC AAG GAC GCG CTG CAG CTC CTG ATT GAG CAC TCG TGG GAG AGG 867 

Gly Cys Lys Asp Ala Leu Gin Leu Leu lie Glu His Ser Trp Glu Arg 

270 275 260 

GGA GAG AGG CTG GAT ATG CAG GCA CTA AAA CAA TCG TCA ACA GAG CTC 915 
Gly Glu Arg Leu Asp Met Gin Ala Leu Lys Gin $er Ser Thr Glu Leu 

285 290 295 

CTC TTT GGT GGT CAT GAA ACT ACA GCC AGT GCT GCG ACA TCA CTG ATC 963 

Leu Phe Gly Gly His Glu Thr Thr Ala Ser Ala Ala Thr Ser Leu He 
300 305 310 

ACT TAC CTA GGA CTC TAC CCA CAT GTC CTC CAG AAA GTT CGA GAA GAG 1011 

Thr Tyr Leu Gly Leu Tyr Pro His Val Leu Gin Lys Val Arg Glu Glu 
315 320 325 

ATA AAG AGC AAG GGC TTA CTT TGC AAG AGC AAT CAA GAC AAC AAG TTA 1059 
He Lys Ser Lys Gly Leu Leu Cys Lys Ser Asn Gin Asp Asn Lys Leu 
330 335 340 345 

GAC ATG GAA ACT TTG GAA CAG CTT AAA TAC ATT GGG TGT GTC ATT AAG 1107 
Asp Met Glu Thr Leu Glu Gin Leu Lys Tyr He Gly Cys Val He Lys 

350 355 360 

GAG ACC CTG CGA TTG AAT CCT CCG GTT CCA GGA GGG TTT CGG GTT GCT 1155 
Glu Thr Leu Arg Leu Asn Pro Pro Val Pro Gly Gly Phe Arg Val Ala 

365 370 375 

CTG AAG ACT TTT GAG CTG AAT GGA TAC CAG ATC CCC AAG GGC TGG AAT 1203 
Leu Lys Thr Phe Glu Leu Asn Gly Tyr Gin Il« Pro Lys Gly Trp Asn 
380 385 390 

GTT ATT TAC AGT ATC TGT GAC ACC CAC GAT GTG GCA GAT ATC TTC ACT 1251 
Val He Tyr Ser lie Cys Asp Thr His Asp Val Ala Asp He Phe Thr 
395 400 405 
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AAC AAG GAG GAA TTT AAT CCC GAC CGC TTT ATA GTG CCT CAT CCA GAG 1299 
Asn Lys Glu Glu Phe Asn Pro Asp Arg Phe lie Val Pro His Pro Glu 
410 415 420 425 

GAT GCT TCC CGG TTC AGC TTC ATT CCA TTT GGA GGA GGC CTT CGG AGC 1347 
Asp Ala Ser Arg Phe Ser Phe lie Pro Phe Gly Gly Gly Leu Arg Ser 

430 435 440 

TGT GTA GGC AAA GAG TTT GCA AAA ATT CTT CTT AAG ATA TTT ACA GTG 1395 
Cys Val Gly Lys Glu Phe Ala Lya lie Leu Leu Lys He Phe Thr Val 

445 450 455 

GAG CTG GCT AGG CAC TGT GAT TGG CAG CTT CTA AAT GGA CCT CCT ACA 1443 
Glu Leu Ala Arg His Cys Asp Trp Gin Leu Leu Asn Gly Pro Pro Thr 
460 465 470 

ATG AAG ACA AGC CCC ACT GTG TAC CCT GTG GAC AAT CTC CCT GCA AGA 14 91 

Met Lys Thr Ser Pro Thr Val Tyr Pro Val Asp Asn Leu Pro Ala Arg 
475 480 485 

TTC ACC TAC TTC CAG GGA GAT ATC T GAT AGC TAT TTCAATTCTT 1535 
Phe Thr Tyr Phe Gin Gly Asp lie 
490 495 

GGACTTATTT GAAGTGTATA TTGGTTTTTT TTAAAAATAG TGTCATGTTG ACTTTATTTA 1595 

ATTTCTAAAT GTATAGTATG ATATTTATGT GTCTCTACTA CAGTCCCGTG GTCTTTAAAT 1655 

ATTAAAATAA TGAATTTGTA TGaTTTCCCA ATAAAGTAAA ATTAAAAAGT GAAAAAAAAA 1715 

AAAAAAAAAA 1725 



(2) INFORMATION f q r £ E q 1d tfQ:32 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 

Met Gly Leu Pro Ala Leu Leu Ala Ser Ala Leu Cys Thr Phe Val Leu 
15 10 15 

Pro Leu Leu Leu Phe Leu Ala Ala Leu Lys Leu Trp Asp Leu Tyr Cys 

20 25 30 

Val Ser Ser Arg Asp Arg Ser Cys Ala Leu Pro Leu Pro Pro Gly Thr 
35 40 45 

Met Gly Phe Pro Phe Phe Gly Glu Thr Leu Gin Met Val Leu Gin Arg 
50 55 60 

Arg Lys Phe Leu Gin Met Lys Arg Arg Lys Tyr Gly Phe lie Tyr Lys 
65 70 75 80 
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Thr His Leu Phe 



Val Arg Arg He 

100 

Pro Ala Ser Val 
115 

His Asp Ser Ser 
130 

Ser Arg Glu Ala 
145 

Ser Ser Cys Leu 



Val Tyr Pro Glu 

160 

Leu Leu Gly Cys 
195 

Gin Leu Val Glu 
210 

Pro He Asp Val 

225 

Asn Leu He His 



Arg Leu Gin Ala 

260 

Leu Leu He Glu 
275 

Ala Leu Lys Gin 
290 

Thr Ala Ser Ala 

305 

His Val Leu Gin 



Cys Lys Ser Asn 

340 

Leu Lys Tyr He 

355 

Pro Val Pro Gly 
370 



Gly Arg Pro Thr 
85 

Leu Leu Gly Glu 



Arg Thr He Leu 

120 

His Lys Gin Arg 
135 

Leu Gin Cys Tyr 
150 

Glu Gin Trp Leu 
165 

Val Lys Arg L$u 



Glu Pro Gly Pro 

200 

Ala Phe Glu Glu 

215 

Pro Phe Ser Gly 
230 

Ala Arg He Glu 
245 

Thr Glu Pro Asp 



His Ser Trp Glu 

280 

Ser Ser Thr Glu 
295 

Ala Thr Ser Leu 
310 

Lys Val Arg Glu 
325 

Gin Asp Asn Lys 



Gly Cys Val He 

360 

Gly Phe Arg Val 
375 



Val Arg Val Met 
90 

His Arg L<=u Val 
105 

Gly Ala Gly Cys 



Lys Lys Val He 

140 

Val Leu val He 

155 

Ser Cys Gly Glu 
170 

Met Phe Arg He 

185 

Ala Gly Gly Gly 

Met Thr Arg Asn 

220 

Leu Tyr Arg Gly 
235 

Glu Asn He Arg 
250 

Gly Gly Cys Lys 
265 

Arg Gly Glu Arg 



Leu Leu Phe Gly 

300 

He Thr Tyr Leu 
315 

Glu He Lys Ser 
330 

Leu Asp Met Glu 
345 

Lys Glu Thr Leu 



Ala Leu Lys Thr 

380 



Gly Ala Asp Asn 
95 

Ser Val His Trp 
110 

Leu Ser Asn Leu 
125 

Met Gin Ma Phe 



Ala Glu Glu Val 

160 

Arg Gly Leu Leu 
175 

Ala Met Arg He 
190 

Glu Asp Glu Gin 
205 

Leu Phe Ser Leu 



Val Lys Ala Arg 

240 

Ala Lys He Arg 
255 

Asp Ala Leu Gin 
270 

Leu Asp Met Gin 
285 

Gly His Glu Thr 



Gly Leu Tyr Pro 

320 

Lys Gly Leu Leu 
335 

Thr Leu Glu Gin 

350 

Arg Leu Asn Pro 
365 

Phe Glu Leu Asn 
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Gly Tyr Gin He Pro Lys Gly Trp Asn Val He Tyr Ser He Cys Asp 
385 390 395 400 

Thr His Asp Val Ala Asp He Phe Thr Asn Lys Glu Glu Phe Asn Pro 

405 410 415 

Asp Arg Phe He Val Pro His Pro Glu Asp Ala Ser Arg Phe Ser Phe 

420 425 430 

He Pro Phe Gly Gly Gly Leu Arg Ser Cys Val Gly Lys Glu Phe Ala 
435 440 445 

Lys He Leu Leu Lys He Phe Thr Val Glu Leu Ala Arg His Cys Asp 
450 455 460 

Trp Gin Leu Leu Asn Gly Pro Pro Thr Met Lys Thr Ser Pro Thr Val 
465 470 475 480 

Tyr Pro Val Asp Asn Leu Pro Ala Arg Phe Thr Tyr Phe Gin Gly Asp 

405 490 495 

He 



(2) INFORMATION FOR SEQ ID NO: 33 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 273 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 33 



CGCACCCCAG GAGGCGCGCT CGGAGGGAAG CCGCCACCGC CGCCGCCTCT GCCTCGGCGC 60 

GGAACAAACG GTTAAAGATT TTGGGCCASC GCCTCCGCGG GGGGAGGAGC CAGGGGCCCC 120 

AATCCCGCAA TTAAAGATGA ACTTTGGGTG AACTAATTGT CTGACCAAGG TAACGTGGGC 18 0 

AGCAACCTGG GCCGCCTATA AAGCGGCAGC GCCGTGGGGT TTGAAGCGCT GGCGGCGGCG 24 0 

GCAGGTGGCG CGGGAGGTCG CGGCGCGCCA TGG 273 



(2) INFORMATION FOR SEQ ID NO; 34 

fi) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 274 base pairs 
(fi) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 
CGCACCCCCA GGAGGCGCGC TCAGAGGGAA GCCGCCAGTG CGCCGCCTCT GCCTCGGCGC 60 
GGAACAAACG GTTAAAGATT TTTTTGGGCA GCGCCTCGAG GGGGGAGGAG CCAGGGGCCC 120 
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GATCCGCAAT TAAAGATGAA CTTTGGGTGA ACTAATTTGT CTGACCAAGG TAACGTGGGC 190 

AGTAACCTGG GCGGCCTTAT AAAGAGGGCG CGCGGCGGGG TTCGGAGCTA GGGAGGCGGC 240 

GGCAGGTGGC GCGGGAGGCT GAAGCGTGCC ATGG 274 

(2) INFORMATION FOR SEQ ID NO:35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 319 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 



TCGGGGGAAT TAACACCTTT TCAAAGTGAA ATCTCAGGAT TGTCTGCCTT CTACAGGAGG 60 

TGGTATTAAA ATGCGCCTAT AACAAATGGT TGAGAGTTTG GAGCCGCTTC TGCCCTGTGG 120 

GCGGGGCGAG ATGACACCAC AATTAAAGAT GAACTTTGGG TGAACTAATT TATCTGAGGA ISO 

AGTTAACAGG AGGAGACCTG CGCGCAATGG ATATATAAGG GCGCGCAGGC GAGGACGCCC 240 

TCAGTTTGTG CGTAAAGACG CGTCTCCTCT CCAGAAGCTT GTTTTTCGTT TTGGCGATCA 300 

GTTGCGCGCT TCAACATGG 319 



(2) INFORMATION FOR SEQ ID NO: 36 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2677 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS i single 

(D) TOFOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 



GATCCCAGAT CTGCCTATTG CGCCCGATGC CCCGAGGCTC TCTCTTGGAC TCTGGCCCTG 60 

AGTTCTTCTG CGCGATCCTT CGGAGACGTC TGGAGGCCTG CTTTATGCAT CTCTCTTGGA 120 

CCTCAGTTTC CCCACACGTG GGAGGAGGCA GCTGGACGAT TCCTGAAAGG ACTTTCCCTT 180 

GCTTCCTCAT CACGTGGAAG AGAGCCCACC CGGCACCTGG AAATGGAAAG CCAGTGAAGG 240 

CTGCTTTGGG CCGGGGCAKC GGGTGGGACC GGGCGGGAGG GATTCCAAAG AGACCGCCGG 300 

GAAGGCTAGA GCTTGGAATT CCGGCTCCTC GGAGTCCTGG CCCTCCCCCA CCGCCGCCTC 360 

GGAGCTCAGC ACACCTTGGA TGGGGGAGGC GGGCAGCTCC TAGCCCCGCA CCCCAGGAGG 420 

CGCGCTCGGA GGGAAGCCGC CACCGCCGCC GCCTCTGCCT CGGCGCGGAA CAAACGGTTA 4 80 

AAGATTTTGG GCCASCGCCT CCGCGGGGGG AGGAGCCAGG GGCCCCAATC CCGCAATTAA 540 

AGATGAACTT TGGGTGAACf AaTTGtCTCA CCAAGGTAAC GTGGGCAGCA ACCTGGGCCG 600 
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CCTATAAAGC GGCAGCGCCG TGGGGTTTGA AGCGCTGGCG GCGGCGGCAG GTGGCGCGGG €60 

AGGTCGCGGC GCGCCATGGG GCTCCCGGCG CTGCTGGCCA GTGCGCTCTG CACCTTCGTG 720 

CTGCCGCTGC TGCTCTTCCT GGCTGCGATC AAGCTCTGGG ACCTGTACTG CGTGAGCGGC 780 

CGCGACCGCA GTTGTGCCCT CCCATTGCCC CCCGGGACTA TSGGSTTCCC CTTCTTTGGG 840 

GAAACCTTGC AGATGNTACT NCAGGTAAGG GAGGGTGGGG CGGGACAGGC TGCTTCCCCG 900 

GAGCCCGGCG CGGCTCTGGG CTTCTGCTGA AGTCGGGGTA GGCGCCCCCG GGAGGCATGC 960 

TATTGCGGCT AGGAGCAGGG CTGGCGGGAG CGCGGCGCTC CCCGGMKYMC SCTCAWGCSC 1020 

RCWWKTMWCC TCCGCCTYMC TCCCAMAGCG GARSAARWKC YKGMRGATGA AGCGCAGGAA 1080 

ATACGGCTTC ATCTACAAGA CGCATCTGTT CGGGCGGCCC ACCGTACGGG TGATGGGCGC 114 0 

GGACAATGTG CGGCGCATCT TGCTCGGAGA GCACCGGCTG GTGTCGGTCC ACTGGCCAGC 1200 

GTCGGTGCGC ACCATTCTGG GATCTGGCTG CCTCTCTAAC CTGCACGACT CCTCGCACAA 1260 

GCAGCGCAAG AAGGTGGGGG CAGGAGGCGA CGGCTGGACA GGGAGGGGGA CCCCATTTAT 1320 

GAGCGGAATT CCGGCTGATG GATGCTAGGC GCGGGCTAGC AGCTTGAGGT GGGCTAGGAC 130 0 

CCTCTGCCAG CTCCAGGTTA GCTTTCCCAG CTCGGAGAGT GCCATGTGTC TGGCAGGACT 1440 

GGGGGTGTCT GGAAGGGGAC GGCGGTAGAC GAGAGGGGCG GATGGAGGCT TTTAACGCTG 1500 

TCCCCTCCTC GGGACTCAGG TGATTATGCG GGCCTTCAGC CGCGAGGCAC TCGAATGCTA 1560 

CGTGCCGGTG ATCACCGAGG AAGTGGGCAG CAGCCTGGAG CAGTGGCTGA GCTGCGGCGA 1620 

GCGCGGCCTC CTGGTCTACC CCGAGGTGAA GCGCCTCATG TTCCGAATCG CCATGCGCAT 166 0 

CCTACTGGGC TGCGAACCCC AACTGGCGGG CGACGGGGAC TCCGAGCAGC AGCTTGTGGA 1740 

GGCCTTCGAG GAAATGACCC GCAATCTCTT CTCGCTGCCC ATCGACGTGC CCTTCAGCGG 1800 

GCTGTACCGG GTAAGGGCGG CAAACGGGCT GCGGACTAGG GGCGCGGGAC CTGGGCGTCT 1860 

GCTCACCGCC GCGCGCTCTC TGCGCTCAGG GCATGAAGGC GCGGAACCFC ATTCACGCGC 1920 

GCATCGAGCA GAACATTCGC GCCAAGATCT GCGGGCTGCG GGCATCCGAG GCGGGCCAGG 1980 

GCTGCAAAGA CGCGCTGCAG CTGTTGATCG AGCACTCGTG GGAGAGGGGA GAGCGGCTGG 2040 

ACATGCAGGT GAGTAGCAGC TTCAGACCAG GCACTGCGGA GTTTGGTCCC CTGGCTTTCC 2100 

AAGGCGCTGT TCCTGGGGCC CCCAAAGCGC GCGCCTGGGG CCCAGCTTTC TGGAGTGGGC 2160 

GGCCGGCTCA GACTACAGCT ATGGAATCCC GAAGGAAGGC TGAGACACCC GGTCAGGAGA 2220 

GCTGCGGAAG GGGCTGCGGM GGAAACTGGG AGCATCCCCT AGCCTTTAMC AGGTTTCAAA 2280 
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GGGAAAGTTG 


GAATTTGCAA 


AAATGTTAAT 


AAAGAACCTT 


GCGATTTTAA 


TAAAACTAAG 


2340 


ACTTTAACTC 


AGGAGTTTCC 


GGTAGRGCGG 


GGTCGTACTC 


GCCTTACTGC 


TCCAGCTGAA 


2400 


CTAAAGGGAC 


GTTGCATTTT 


GTTTAAAGAT 


ATTGCTTTCC 


TTGACTTTCT 


GTCAGCAAAA 


2460 


CATTTAGCCC 


TTCTAGTCTT 


CGCTCCAGAA 


CTCTCAGTTC 


GATTCTGAGT 


AATCCTTCTG 


2520 


TCAAACCGCA 


GGCAGACTTG 


TGAGAATGTG 


GGTCTCACTC 


TATTCTTAGG 


CACTAAAGCA 


2580 


ATCTTCAACC 


GAACTCCTCT 


TTGGAGGACA 


CGAAACCACG 


GCCAGTGCAG 


CCACATCTCT 


2640 


GATCACTTAC 


CTGGGGCTCT 


ACCCACATGT 


TCTCCAG 






2677 



(2) INFORMATION FOR SEQ ID NO: 37 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(Ki) SEQUENCE DESCRIPTION: SEQ ID NO: 37 



GATCCAGGTT 


GCTGAAACAT 


ATCTCCATAT 


AGGGCAGAAC 


AATTATCAAA 


AGCATAAGAA 


GO 


TTGCAGCCAC 


AGCATAGGGA 


AGAAAGAGGA 


GTTTTTAAAC 


CACAACAAAA 


GGGAGAAAGA 


120 


AGAGAATTTT 


AACT TACATT 


TAATTCAAAA 


GTCTTCAGAG 


CAACCCGAAA 


CCCTCCTGGA 


180 


ACTGGGGGAT 


TCAGTCGAAG 


GGTCTCCTTA 


ATAACACACC 


CGATGTATYT 


AAGTTGTTCC 


240 


AAAATTTCCA 


TGTCCAACTT 


GTTGTCTTGA 


TTGCTCTTGC 


AAAGTAAACC 


CTAYCAAAAY 


300 


AGTCATACAG 


AGGTGAACAG 


TYATTTTGTG 


CTCCAATTAA 


AATCAGCCCA 


GCAGACGTAA 


360 


AGAGGGCTTA 


AGTGGAGACT 


AAACCCAAAG 


GGCCCCATGA 


TGGGAGAGAC 


TGGGAGGGGG 


420 


flAACAGCAGC 


TAATGGCCAT 


TTGCCTGCCC 


AAATCCACTA 


TCTATTTACA 


AT CCC AGGAG 


480 


AATGCTGCTC 


ACCAGTTAGA 


AGGACCAAGT 


TTCTCCCCAC 


GCCCCCCCAC 


CCCACACTCA 


540 


CCACCACCAC 


CCACACTAAT 


CAGCTATTCA 


CACTATGTAT 


GCCCTTGGAC 


AC ACCAAT T C 


600 


AAGAAAAGTG 


GAACCTATCT 


GAGAATCTCC 


ACGGTTCACA aaaaGGtgga 


GGAGGGGTAG 


660 


GAATACAAGG 


TCAAACCCTG 


ccc 








683 



(2) INFORMATION FOR SEQ ID NO; 38 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4164 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:36 
TCGCGAGGAG CGACCACGGC TTGAAGAGGG GTAGACGAGA CCAGATGCTC 
CCTCATGCGG GTTGCGGTCT CTCTCCTCCA CCTCCCTCTC AGCGGAGGAA 
ATGAAGCGCA GGAAATACGG CTTCATCTAC AAGACGCATC TGTTTGGGCG 
CGGGTGATGG GCGCGGATAA TGTGCGGCGC ATCTTGCTGG GAGAGCACCG 
GTGCACTGGC CCGCGTCGGT GCGCACCATC CTGGGCGCTG GCTGCCTCTC 
GATTCCTCGC ACAAGCAGCG AAAGAAGGTG AGGGTGAGCT GGCAACTCCT 
GGAGACCTCA TCCTATGGCT TGGTTCAGGC AAAATAGAAT GCGGGGCGAG 
ATGTGGTGGG GACCAGGACC CTCTCTATCT GAGATCCACT TTAGCTTTTC 
TGGGTTAGTC CTGGGGGGGA CTGAAATTCT TGAAAGGGTA CTCGGAAAGG 
GGGGCTGAGG GAAAGTAGAG GATTGTAACA CTCTCTGCTC CTGGGGGGTG 

TATGCAGGCC TTCAGCCGCG aggcactcca gtgctacgtg cccgtgatcg 
cagcagttgt ctggagcagt ggctaagctg cggcgagcgc ggcctcctgg 
ggtgaagcgc ctcatgttcc gcatcgccat gcgcatcctg ctgggctgcg 

AGCGGGCGGC GGGGAGGACG AGCAGCAGCT CGTGGAGGCT TTCGAGGAGA 
TCTCTTCTCT CTTCCCATTG ACGTGCCCTT TAGCGGCCTG TACCGGGTAA 
CGGAGTCGGA GTAGGGGAAC GCAAGCTCGG GCATCCGCTC ACCGCCACGC 
CTCAGGGCGX GAAGGCGCGG AACCTTATAC ACGCGCGCAT CGAGGAGAAC 
AGATCCGCCG GCTTCAGGCT ACAGAGCCGG ATGGGGGTTG CAAGGACGCG 
TGATTGAGCA CTCGTGGGAG AGGGGAGAGA GGCTGGATAT GCAGGTGAGA 
AAAGGTGCCA AGGGCCGGGG AGTGCCTCTG ACTTTCCAGA CACACTTTCT 
AAGCCCTGTC AAGGCCCCAG CTACTTCCAA GTGGGCGGCG ATGCTAGGTC 
CAACCTGTGG GTCGTGACCC CTTCACGGAG CCAAACAACC CTTTCAGAAG 
GAGCATCTGC ATATCCGATA TTTACATCAA GAAACATAAC AGTAGCAAAA 
GAAGTAGCAA CAAAGATAAT TTTATCGTTG GGGGTCACCA CAACACGAGG 
AAGGGTGGCA TTGGTCTAGA GAGCTGTGGA AGGGGGTGGC TGAGCAATGG 
AAAGTTCAAA GGGCAAGGCT CATCTACAAA GGTTAAAGCG GAAGAGCAGG 
TTTTGCGTTT TTGTTTGTGG TCTTTGACTT TCTATGAACA AAACGGATTT 
GTCTTCCGTG CAATATTCTC AGGTCAGGTC TTTGTAACAG TGCTATAAAC 



416 8S5 7380 P. 



CCCGGCGCCC 60 

GTTrCTGCAG 120 

GCCCACGGTG 1B0 

GTTGGTGTCG 24 0 

CAACCTGCAC 300 

TGGCTGGCAG 360 

GGCTAGTCCT 420 

TGCTAGCACG 480 

CGAAGGGGGG 540 

CTCAGGTGAT 600 

CTGAGGAAGT 660 

TCTACCCCGA 720 

■ 

AGCCGGGTCC 7S0 

TGACCCGCAA 640 

GGGCGGTTTG 900 

TCTCTCCGCG 960 

ATTCGCGCCA 1020 

CTGCAGCTCC 1080 

AGCAATTTCA 114 0 

GGGGTCTCCA 1200 

TAGAGCTTTT 1260 

GGTCGCCTAA 1320 

TTACCGTTAT 136 0 

AACCGTATTA 144 0 

GGAAGATCCC 1500 

ATTAAGGGAG 1560 

TACCCTTGAA 1620 

TGCACTCAGA 168 0 
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TCTGTATAAA CTTCCGTTTT TATCCTTAGG CACTAAAACA ATCGTCAACA GAGCTCCTCT 1740 

TTGGTGGTCA TGAAACTACA GCCAGTGCTG CGACGTCACT GAT C ACT TAG CTAGGACTCT 1800 

ACCCACATGT CCTCCAGAAA GTTCGAGAAG AGATAAAGAG CAAGGTAGGA TGATTCTAGA I860 

GGTTCCCCAT TTGCCTAGGA CATTCCTCTA TTA&CCACCA CCACCACCCC CACTGTATAT 1920 

AAGTTTGCTC GATACACCCA GTACTATGAC AGTGAAGATC TGAGAGCTAG GTGGGACTGT I960 

GGGGGAGAGA CTCCACCTCG TGAATTTAAA AAGGCAGTTG TTTGTACTGG GCTCTCTCTT 204 0 

GGGCAGAATT TGACCCTCTC CTCCTCCTCC TCCTCCTCCT CCTCTTCCTC CTCCACCACC 2100 

ACCACCATCA CCACCTTTTA TAGAGCAAGG TTCTCCTTTC CCTGACCAAG AACATGAATA 2160 

ATGTGATTAG AGCCAATAGC TGATCAGGGT CGCAGTGTTG GTGAGGGCTC AGGGTATGAC 2220 

CCTTTATATA CCTGATAAGC AACATTGTCT GGATAATGGG TTTAGGCTGA GGAAGTGTGG 2290 

AAAGGAAGGC CAT CAGGCCA TCAGCTCTTT CCCTTTTATC CTCTCCCATC CAGACGCCTT 2340 

CAGGTTTAGT TAACAGGTGA GTCCTGCTGG GCTGACTTTT TTTTTGGAGT GCCCAGGGA1 1 2400 

CCATCACTCA CTTTTTTATC TGTTTCCATA GGGCTTACTT TGCAAGAGCA ATCAAGACAA 2460 

CAAGTTAGAC ATGGAAACTT TGGCACAGCT TAAATACACT GGGTGTGTCA TTAAGGAGAC 2520 

CCTGCGATTG AATCCTCCGG TTCCAGGAGG GTTTCGGGTT GCTCTGAAGA CTTTTGAGCT 2590 

GAATGTGAGT GCACCTCCTG TCCCCCACCC CCAGCCCTCG TCCACGTCCA CTCTGCTATG 2 640 

CTGTTGAGCA TCAGCTGCCC AGAGCAGTGG CTGACTGCCC TTGACAGTGT CCTGCCTCCT 2700 

ATGGTACTGG GAACCAATTT GCTCTCCTCT CTTAATGCCA TCCATGCTAG TAATGACTTT 2760 

TTGTTGTTGC AAGCTCAGGG CCGGGATTGT CAATTCTTAG GATTTTTTTT TTTTTTTAAA 2820 

CAGGGATACC AGATCCCCAA GGGCTGGAAT GTTATTTACA GTATCTGTGA CACCCACGAT 2880 

GTGGCAGATA TCTTCACTAA CAAGGAGGAA TTTAATCCCG ACCGCTTTAT AGTGCCTCAT 294 0 

CCAGAGGATG CTTCCCGGTT CAGCTTCATT CCATTTGGAG GAGGCCTTCG GAGCTGTGTA 3000 

GGCAAAGAGT TTGCAAAAAT TCTTCTTAAG ATATTTACAG TGGAGCTGGC TAGGCACTGT 3060 

GATTGGCAGC TTCTAAATGG ACCTCCTACA ATGAAGACAA GCCCCACTGT GTACCCTGTG 3120 

GACAATCTCC CTGCAAGATT TACCCACTTC CAGGGAGATA TCTGATAGCT ATTTCAATTC 31B0 

TTGGACTTAT TTGAAGTGTA TATTGTTTTT TTTAAAATAG TGTCATGTTG ACTTTATTTA 324 0 

ATTTCTAAAT GTATAGTATG ATATTTATGT GTCTCTACTA CAGTCCCGTG GTCTTAAATA 3300 

TTAAAATAAT GAATTTGTAT GATTTCCCAA TAAAGTAAAA TTAAAAAGTG CTTCTCTTGC 3360 
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TTTTTAAGAT TCTTGTTGGC AAGCTGCCCA TGGTGGTACA TTGCTGTAAT ACTAGGACTT 3420 

GGAAGGTGGA GGCAAGAAGA GCAGGCATTC AAGGCTAGCC TGGGCTACAG AAATCCTGTC 3480 

TTAAACAAAC ACTACAACAA AAAGTCCTGT TAGGGAATCT GACTGGCTCA GTGTTTGTAC 3540 

TTTGTGTATT TAAAATGATT TAGAGTGAAA CCATAGGTCT CTCCCCCATG TCAGAAAATA 3600 

T AT AT TAT T A TGTGTATGCT GATCCAAAGT ATCTTTGTAA CTTTTTCTAA GGTCATTGAG 3660 

ACTTCATATT TTGAAATTGT ATGGAGGCTA GTTATATTAC ATTATTTATT TATTTATTTA 3720 

TTTACATTTT TATGGTGCTG GGGATTGGAT CGAAGGCTTC ACACCTCTAG GGCAAGCCCT 379 0 

TTGTCATTAA GGCGCTGCCT CTCCCTTTCA GCCCAACGTT AATTCTAGAT TCTTTTTCTT 3840 

TGGTGCTTTT GGGAGGTAAA CCTGGGATGC TGCAGTTATT TGGTGGTGGT CGTTGGTTTT 3900 

ACTCTAGAGA GAAGGCAACT TTGGGAAGGC AACACTGCTG CTGGTGAGTC GGGAAGCATC 3960 

ATCCCAGAGC AACGGGGTCA GCATAGCTAA CATTTTAAAT CAGCATAATG AATCCCTGTC 4020 

ATATGGAGGA GGCAGAACXC CTCTTTGAAG TTGATATTTT AGATAAGACA GAGCCAGCCC 4080 

CTCTGGTTAT GGACAGTTCT TACCCAAAAT GAAACAGAGA AGAAAACCAC TGGTGTGTCA 414 0 

CCTTTCCTTA GAAGTGCTTC AGGA 4164 

(2} INFORMATION FOR 5EQ ID NO: 39 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE i nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix> FEATURE: 

(D) OTHER INFORMATION: Each N can represent any nucleotide 

and there can be 0 to 5 N 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 

TGAACTNNNN NTGAACT 17 



(2) INFORMATION FOR SEQ ID NO: 4 0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) StrandednesS; single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 
TCTGASSAAG KTAAC 15 
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(2) INFORMATION FOR SEQ ID NO: 41 

(i) SEQUENCE CHARACTERISTICS J 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION ! SEQ ID NOHl 
CAATTAAAGA 10 

(2) INFORMATION FOR SEQ ID NO: 42 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 42 
CAATTAAAGA TGAACTTTGG GTGAACTAAT T 31 

(2) INFORMATION FOR SEQ ID NO: 4 3 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: $EQ ID NQM3 

GTAGCACGGA TGGTG 15 
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